A Graph-based Readability Assessment Method using Word Coupling

نویسندگان

  • Zhiwei Jiang
  • Gang Sun
  • Qing Gu
  • Tao Bai
  • Daoxu Chen
چکیده

This paper proposes a graph-based readability assessment method using word coupling. Compared to the state-of-theart methods such as the readability formulae, the word-based and feature-based methods, our method develops a coupled bag-of-words model which combines the merits of word frequencies and text features. Unlike the general bag-of-words model which assumes words are independent, our model correlates the words based on their similarities on readability. By applying TF-IDF (Term Frequency and Inverse Document Frequency), the coupled TF-IDF matrix is built, and used in the graph-based classification framework, which involves graph building, merging and label propagation. Experiments are conducted on both English and Chinese datasets. The results demonstrate both effectiveness and potential of the method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Readability assessment of patient education materials from the American Academy of Otolaryngology--Head and Neck Surgery Foundation.

OBJECTIVE Americans are increasingly turning to the Internet as a source of health care information. These online resources should be written at a level readily understood by the average American. This study evaluates the readability of online patient education information available from the American Academy of Otolaryngology--Head and Neck Surgery Foundation (AAO-HNSF) professional Web site us...

متن کامل

Analysis of InGaAsP-InP Double Microring Resonator using Signal Flow Graph Method

The buried hetero-structure (BH) InGaAsP-InP waveguide is used for asystem of double microring resonators (DMR). The light transmission and location ofresonant peaks are determined for six different sets of ring radii with different ordermode numbers. The effect of changing middle coupling coefficient on the box likeresponse is studied. It is found that the surge of coupling coefficient to the ...

متن کامل

Graph-based Coherence Modeling For Assessing Readability

Readability depends on many factors ranging from shallow features like word length to semantic ones like coherence. We introduce novel graph-based coherence features based on frequent subgraphs and compare their ability to assess the readability of Wall Street Journal articles. In contrast to Pitler and Nenkova (2008) some of our graph-based features are significantly correlated with human judg...

متن کامل

Assessment of online patient education materials from major ophthalmologic associations.

IMPORTANCE Patients are increasingly using the Internet to supplement finding medical information, which can be complex and requires a high level of reading comprehension. Online ophthalmologic materials from major ophthalmologic associations should be written at an appropriate reading level. OBJECTIVES To assess ophthalmologic online patient education materials (PEMs) on ophthalmologic assoc...

متن کامل

EFL Textbook Evaluation: An Analysis of Readability and Vocabulary Profiler of Four Corners Book Series

This study aimed to investigate whether there is any significant relationship between the readability and vocabulary profile including the most frequent words (K1 words) and academic word list (AWL) of reading passages of Four Corners series which were EFL textbooks. To determine the readability of the texts, the Flesch–Kincaid (1975) readability test was used, while the texts' academic word li...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015